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WIDTH AND MODE OF THE PROFILE FOR SOME RANDOM 
TREES OF LOGARITHMIC HEIGHT 

By Luc Devroye 1 and Hsien-Kuei Hwang 1 ' 2 
McGill University and Academia Sinica 

We propose a new, direct, correlation-free approach based on cen- 
tral moments of profiles to the asymptotics of width (size of the most 
abundant level) in some random trees of logarithmic height. The ap- 
proach is simple but gives precise estimates for expected width, cen- 
tral moments of the width and almost sure convergence. It is widely 
applicable to random trees of logarithmic height, including recursive 
trees, binary search trees, quad trees, plane-oriented ordered trees 
and other varieties of increasing trees. 

1. Introduction. Most random trees in the discrete probability literature 
have height either of order y/n or of order logn (n being the tree size); see [1]. 
For simplicity, we call these trees square-root trees and log trees, respectively. 
Profiles (number of nodes at each level of the tree) of random square-root 
trees have a rich connection to diverse structures in combinatorics and in 
probability, and have been extensively studied. In contrast, profiles of ran- 
dom log trees, arising mostly from data structures and computer algorithms, 
were less addressed and only quite recently were their limit distributions, 
drastically different from those of square-root trees, better understood; see 
[3, 12, 13, 21, 27] and the references therein. 

We study in this paper the asymptotics of width, which is defined to be the 
size of the most abundant level, and its close connection to the profile. There 
are many results on first-order asymptotics of profiles for standard log trees, 
such as binary search trees, random recursive trees, m-ary search trees and 
quad trees. In some cases, quite accurate asymptotic expressions are known 
for the expected profile. There is already a paucity of results with regard to 
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higher moments of the profile, let alone limit laws for properly centered and 
scaled profiles. The literature on this subject was surveyed by Drmota and 
Hwang [12, 13] and Fuchs, Hwang and Neininger [21], but some key historical 
references are repeated here. In fact, the last paper describes the complete 
asymptotics for the profile of random recursive trees and random binary 
search trees, two important representatives. The present paper extends the 
results to a universal class of log trees called width-regular. It adds a host 
of new results on the width of these trees, such as its exact location up to 
O(l) as well as tight estimates on the central moments of the width and some 
strong laws of large numbers. Equally important is the fact that these results 
are obtained from simple moment estimates. For example, the correlation 
between the profiles at different levels is not needed at all in the study of 
the width. 

Recursive trees. A prototypical log tree is the recursive tree, which has 
been introduced in diverse fields due to its simple construction. We will 
present our methods of proof for recursive trees and then indicate the re- 
quired elements needed for other random trees. 

Combinatorially, recursive trees are rooted, labeled, nonplane trees such 
that the labels along any path down from any node form an increasing se- 
quence. By random recursive trees, we assume that all recursive trees of n 
nodes are equally likely. Probabilistically, they can be constructed by suc- 
cessively adding nodes as follows. Start from a single root node with label 1. 
Then at the iih stage, the new node with label i chooses any of the previous 
i — 1 nodes uniformly at random [each with probability l/(i — 1)] and is 
then attached to that node. This construction implies that there are (n — 1)! 
recursive trees of size n. The first paper on such tree models we could find 
is Tapia and Myers (under the name of concave node- weighted trees); see 
[45] and [13, 21, 43] for more references on the literature of recursive trees 
and their uses in other fields. 

Note that the term "recursive trees" (first used in [35] and [37]) is less 
specific and has also been used in different contexts for different objects. 
For example, they are used in recursion computation theory to represent a 
computable set of strings with branching structure and in compilers to record 
the history of recursive procedures. They also appeared in classification trees, 
dynamic systems and data base languages with a different meaning. 

Profile. Let Y n ^ denote the number of nodes at distance k from the 
root in random recursive trees of n nodes (the root being at level zero). 
Such a profile is very informative and closely related to many other shape 
parameters, although it does not uniquely characterize the tree. 

The combinatorial sister of the random recursive tree is the random binary 
search tree. Early work by Lynch [30] and Knuth [28] (see also [42]) showed 
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that the expected profile of random binary search trees is related to Stirling 
numbers of the first kind and it peaks at k about 2 log re. It peaks at logn + 
0(1) in random recursive trees. In contrast, the random variable Y n ^ has 
received less attention until recently. 

The profiles of random binary search trees and random recursive trees 
exhibit many interesting phenomena such as (i) bimodality of the variance, 
(ii) different ranges for convergence in distribution and for convergence of 
all moments of the normalized profile Y n k/M{Y n fc}, (hi) no convergence to 
a fixed limit law at the peak levels and (iv) sharp sign changes for the cor- 
relation coefficients of two level sizes; see [13, 21] for more information. See 
also [27], where the limit distribution of the profile of random binary search 
trees was first studied, and [3], where the width was addressed. However, 
their approach, which is based on martingale arguments, is very different 
from the moment approach used in this paper. 

For simplicity, write throughout this paper L n := maxjlogn, 1}. The ex- 
pected profile ^L n: k :=E{y nj fc}, which gives the first picture of the general 
silhouette of random recursive trees, is known to be enumerated by the 
signless Stirling numbers of the first kind (see [43] and [21]) 

^ = n ( 1 + 7) = ( n-l 

k i<i<n J 

From this, it follows by the saddle point method that 

"•-^ e " iV,21 " ,+0<ia|3/L?J ( 1+0 ( i ^)) 

uniformly for k = L n + 0(L n ), where, here and throughout this paper, A := 
k — L n . The asymptotic approximation (1.1) is crucial for our analysis. It 
means that most nodes in a random recursive tree are located at the levels 
with k ~ L n . In particular, we have 

(1.2) max/i nifc = -^=(l + 0(L- 1 )); 

see [46] or [25] for more precise expansions for [i n ^- 

Expected width. We define the width of random recursive trees to be 
W n := maxfc l^fc. The approximation (1.1) can be interpreted as a local 
limit theorem for the depth, which is the distance to the root of a randomly 
chosen node in recursive trees (each with the same probability). Thus it is 
intuitively clear that the width will be roughly close to n/^2nL n . Our first 
result gives a more precise description of this. 

Theorem 1.1. The expected width satisfies 

(1.3) E{^} = -^==(l + e(L- 1 )). 
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This theorem improves upon the error term 0(L n logL n ) given in [13], 
where the proof depends on estimates for correlations of two level sizes and 
a tightness argument for the process. The approximation (1.3) also says, 
when compared with (1.1), that 

nWn}=Vn,L n+ 0(l)(l + 0(L- 1 )). 

In particular, by (1.2), 

E{maxY n>fc l = maxE{y n . fc }(l + O(L^)). 

{ k J k 

Note that the index k reaching the maximum of \x n ^k satisfies 

(1.4) k= \L n -i + 1 + o{L~ 1 )\- 

see [22] or pages 140-141 of [24]. Erdos [14] showed that k is unique. 

An estimate for absolute central moments. We see from (1.3) that the 
expected width is asymptotically of the same order as the expected level sizes 
at k = L n + 0(1). We show that not only their expected values are of the 
same order, but also all higher absolute central moments are asymptotically 
close. 

Theorem 1.2. For any s > 0, 

(1.5) E{\W n ~ K{Wn}\ s } = 0(n s L- 3s ' 2 ). 

From [21], we have 

(1.6) E{(Y n ,k- f in,k) m } = 0(\A\ m L- m v™ k ) (k = L n + o(L n )). 

By Lyapounov's inequality (see page 174 of [29]), we obtain, for any s > 0, 

(1.7) E{|y nifc -/j nifc | s } = 0(|A| s L;Vn,fc) (k = L n + o(L n )). 
In particular, this implies, by (1.1), that 

E{| W n - E{W n }\ s } = 0(E{|y n , fc - fi nik \ s }) (s > 0) 
for k = L n + 0{l). 

Almost sure convergence. In sequential growth models of random trees, 
such as the incremental model for random recursive trees described above, it 
makes sense to study almost sure convergence properties. One can also define 
sequential growth in random binary search trees and most other log trees 
discussed in this work. This is because most definitions derive from data 
storage applications, where the sequential insertion of new data in trees is a 
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natural way to grow them. For random recursive trees, we use (1.5) to show 
that 

(1.8) — ' 1 almost surel y- 

This result was proved in [13] by martingale arguments and complex anal- 
ysis, following [3]. Our proof relies on (1.5) with s = 2 + e and the usual 
Borel-Cantelli argument. It is conceptually simpler and also applies to ran- 
dom trees for which no martingale structure is available. 

Level reaching the width. Let k* denote one of the levels such that Y n ^* = 
W n . We show that k* takes most likely the values L n + 0(1). 

Theorem 1.3. For every B > 0, there exists Tq > 1 such that 
F(\k* -L n \ >T) = 0(T~ B ) 

forT>T . 

Thus width will with very small probability lie outside the range L n + 
0(1). 

Approaches used. The most notable feature of our method of proof is that 
with the two crucial estimates (1.1) and (1.6) at hand, only basic probabil- 
ity tools such as Markov and Chebyshev inequalities and the Borel-Cantelli 
lemma are used. However, asymptotic tools for proving the two estimates 
for general random trees may differ from one case to another. In most cases 
we considered, the estimate (1.1) is proved by a combination of diverse an- 
alytic tools such as differential equations, singularity analysis (see [19]) and 
the saddle point method. The remaining analysis required for higher cen- 
tral moments of the profile is then mostly elementary, since this corresponds 
roughly to the large "toll-functions" cases for the underlying bivariate recur- 
rences; see [21]. Although tools for handling the width of random square-root 
trees are very different from those for random log trees considered in this 
paper, the profiles provide in both classes of trees an accessible route to the 
asymptotics of the width; see [11] and the references therein. 

Generality of the phenomena. The treatment for random recursive trees 
can be extended to width-regular trees, a large class of log trees defined below. 
This class includes familiar trees such as random m-ary search trees, quad 
trees, grid trees and increasing trees. To check whether a tree is width-regular 
is done case by case, unfortunately, because we need uniform estimates on 
the expected profile. A universal asymptotic tool is still lacking to extend 
the results, for example, to all random split trees [8]. 
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Organization of the paper. For self-containedness and to pave the way 
for general random trees, we give a sketch of the proof for (1.1) and (1.6) in 
the next section. We then prove the theorems in Section 3. Extension of the 
same arguments to other log trees is given in Sections 4-7. 

Notation. Throughout this paper, the generic symbol e > always 
represents a sufficiently small constant whose value may differ from one 
occurrence to another. Also L n := maxjlogn, 1} and A := k — L n . 

2. Estimates for the profile moments. We briefly sketch the main ideas 
that lead to the estimates (1.1) and (1.6); see [21] for details and more precise 
estimates than (1.6). 

Recurrence ofY n ^ . By construction, the profile of random recursive trees 
satisfies the recurrence 

v ± sr I \- f n ~ 1 \ C?i -!)'••• (is -i)' 

' k ~ih s] -4. h 1 W' .iJ (n-l)! 

l<s<n jiH hj s =n-lyj v , 

jl )•• • ijs ^0 p / ^} 1C roo t degree equals s and \ 

Vthe s subtrees have sizes j ± , . . . ,js f 

x (*$-!+ -+i£Li) 

(i) 

for n > 2 and k > 1 with Y\ o = 1, where the ^'s are independent copies of 
y^fc. From this we deduce, by conditioning on the size of the first subtree, 
that 

(2.1) Y ntk I y /n , fe _a + y;_ /n>fc ( n > 2 ; > i), 

with Y\ t Q = 1, where the y^^'s are independent copies of Y n> k and indepen- 
dent of I n , which is uniformly distributed in {1, . . . ,n — 1}. 

The expected profile and the expansion (1.1). From (2.1), we derive, by 
taking expectation and by solving the resulting recurrence, the relation 

see [10, 36, 43]. Then by singularity analysis (see [19]), 

(2.2) J2^, k u k = fir^si 1 + o(M(i + Hh' 1 )), 

k ^ ' 

where the O-term holds uniformly for \u\ < C for any C > 0. Note that the 
Stirling formula with complex parameter for the Gamma function does not 
give the required uniformity in u. 

The uniform approximation (1.1) is then obtained by Cauchy's integral 
formula using (2.2) and the saddle point method. 
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A uniform estimate for pL n ,k- A very useful uniform estimate for [i n ^ is 
given by 

(2.3) /v fc = 0(L- l ' 2 r- k n r ) (0 < r = O(l)) 

uniformly for all < k < n. This is easily obtained by Cauchy's integral 
formula and (2.2) since 



/in,fe = O (r~ k n r £ n^ 1 "^ dt) . 



which gives (2.3). Throughout this paper, r is always taken to be r = 1 + o(l) 
unless otherwise specified. 
Although one can prove that 

(2-4) ^ fc = (jl 

uniformly for < k < n, the reason for using the estimate (2.3) instead of 
(2.4) is that, for general random search trees, it is much harder to derive the 
Poisson type estimate (2.4) for all k. 

Recurrence of higher central moments. Let P^ = E{(l^ 5 fc — fJ- n ,k) m }- 
Then P^} satisfies, by (2.1), the recurrence 

p (m) _ _ p( m ) i p( m ) i _i_ r>( m ) 

r n,k ~ _ ^ V j,k-l " r r n-j,k> ^n,k ' 

l<j<n 

with P^f = for n,m> 1, where 

(a,6,c)6X m V ' ' y l<j<n 

with V„,fc(j) := /Xj,jfe_i + - Mn,fe and 

:= {( a > &jc) £ I* 3 : a + b + c = m,0 < a, b < m,0 < c < m}. 
We prove (1.6) in two stages. A uniform estimate for ^7 n ,k(j) for 1 < j, k < 
n is first derived, which then implies by induction a uniform bound for 
for 1 < k < n. This bound is, however, not tight when A = o(y/L n ). Then 
we refine the estimate for V n ,k{j) when A = 0(y/L^), which then leads to 
(1.6) by another induction. 

First estimate for Pj^l . By (2.2), we have the integral representation 

1 / u~ k ~ x n u 
V n ,fc(j) = — * ———ip{u\i/n) 
2m J\u\=r T(l + U) 

(2.5) 

x (l + O^-i + ^-j)- 1 ))^, 
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where tp(u; x) := ux u + (1 — x) u — 1. Since <p(l; x) = 0, we have 

r(i + «) Vl |; 

uniformly for x G [0, 1]. Substituting this estimate into (2.5), we obtain 
^n,h{j) = o(r~ k n r J* \re ie - l\n- r ^- cose Ue^ 

(2.6) 

= 0{(\r-l\+L- 1 l 2 )L- 1 l\- k n r ) 

uniformly for 1 < j, k <n, where r = 1 + o(l). This bound is not tight for 
all k, but is sufficient for most of our purposes. In particular, since r is 
not specially chosen to minimize the error term, (2.6) is not optimal when 

— 1/2 

\r — 1| = o(L n ), which is the case when we choose r = k/L n and \k — L n \ = 

We now prove by induction that 
(2.7) pj$ =0((\r-l\ m + L- m / 2 )L~ m / 2 r- km n mr ) (m > 0) 
uniformly for 1 < k < n. 

Obviously, (2.7) holds for m = 0, 1. Assume m > 2. To estimate Q^k , we 
split the sum into two parts, 

QS = X (a b c) ^3t( X + X ) P ifcU P i-i,fc V n,fc(i)> 
(a,6,c)ex m v ' ' J \jaJm jejj 

where J" m := {j:n/L™ <j < n-n/L%} and J' m := {1, . . . , n- 1} \ J m . Then 
by induction and (2.6), the terms in with j £ J' m are bounded above 
by 



-a/2 jar 



V (a,6,c)el m V j<n/L™ 

+ L -(a+c)/2 n (a+c)r L J b/2 j br )) 

= 0{L^ m l 2 r- mk n mr ) 

uniformly for 1 < k < n. 

On the other hand, when j G J m , we have Lj ~ L n -j ~ ^n! thus by in- 
duction and the two estimates (2.6) and (2.7), 

m \ I ^ (a) D (6) 



(a,6,c)SXm j&Jm 



0{{\r-l\ m + L- m ' 2 )L- m l 2 r- 



■km mr\ 
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it follows that 

(2.8) Q<$ = 0((|r - l| m + L- m l 2 )L- m l 2 r~ km n mr ) 

uniformly for 1 < k < n. 

From [21], we have the closed-form expression 

r>( m ) , x 

M p£?=qS+ E E %V](« + i) n (i + r). 

I<j<n0<l<k J j<h<n^ ' 

where [u \F(u) denotes the coefficient of u in the Taylor expansion of F. 
Substituting the estimate (2.8), we obtain 

4 ) =o(gS+^ m e (ir-ir+L7 m/2 )L7 m/2 r r_1 

V l<j<n 



X 

0<^< 

Now 

it 



E r"V](« + i) n (i + r))- 

<£<fc j<kti v 7 / 



E r™VK« + l) II 1 + f )<(l + r m ) n 1 + 



n x 



O 

J 

Thus (2.7) follows. 

When k ~ L n , we take r = k/L n in (2.7), giving 

which proves (1.6) when vLn < |A| = o(L n ). 

Proof of (1.6) u;/ien A = 0(y/L^,). We now refine the above procedure 
and prove (1.6) when A = 0(y/L n ), which has the form 

(2.10) = 0(\A\ m L~ 3m / 2 n m ) (m>0). 

By applying the expansion 

cp(u;x)=<p' u (l;x)(u-l)+0(\u-l\ 2 ) (x€ [0,1]) 

and the usual saddle point method to (2.5), we deduce that 

(2.H) V n<k (j) = 0(\A\L- 3 / 2 n) 

uniformly for A = 0(\fL^_) and 1 < j < n. Note that this estimate also 
follows from (1.1). 
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We apply the same inductive procedure used to prove (2.8). By applying 
(2.7) to terms with j E J' m and (2.10) to terms with j E J m [starting from 
(2.11)], we have 

Q$=0(\ArL- 3m / 2 n m ) (m>2) 

uniformly for A = 0(^/17^. This estimate and (2.8) gives, by (2.9), (2.10) 
and a similar decomposition of the sums involved, 

p$=o(qS+Y, E ik-t-L^Lf^r-v 



ote+E E 

V j&Jm o<e=o(i 

+1) n H)) 

j<h<n v 7 / 



X (u ■ 



0(\A\ m L- 3m / 2 n m ). 



This completes the proof of (2.10). 

Such a two-stage proof of (1.6) is completely general when we have an 
integral representation for V nj fc(j) of the form (2.5) and a closed-form similar 
to (2.9). We will sketch means to handle the cases when no closed-form 
solution like (2.9) is available. 

3. Asymptotics of the moments of the width. We first prove Theorem 1.1; 
then we extend the proof for (1.5) and finally prove Theorem 1.3. 

3.1. Expected width. 

Lower bound for the expected width. The lower bound follows easily from 
the inequality 

E{W n }>M n: 

where 

M n := maxE{Y nifc } = Jl— (1 + C>CO); 
k \J 2irL n 

see (1.2). 

An inequality for the upper bound. For the upper bound, we use the in- 
equality 

E{W n }<M n + ]T E{(y n , fc -M n )+}+ Vn,k 

\A\<K \&\> K 



where K := lV 2 ' . 
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The sum u>n ■ The last sum is easily estimated, since by (2.3), 



w 



i^o ^v E + E Y k 

\ \0<k<L n -K k>L n +K/ J 



Taking r = 1 — L n 1 ^ 3 , we see that 



/ r -L n +K\ 

E ^ = 0[L-Wn* T -—) 

0<k<L n -K 



= o(L^/v^ 1/3 Ly 3 (i - l- 1 / 3 )- 1 ^ 3 ) 

= 0(nL- 1 /6 e -^ /3 /2 ) . 

— 1/3 

The same upper bound holds for J2k>L n +K Mn.fc by taking r = 1 + L n 

(2) 

An estimate for the second sum Wn ■ We use the inequalities 
E{(y n>fc - M n )+} < E{(Y n , k - ii n , k )l {Yn k>Mn )} 

< E{(y ra , fc -^ n , fc ) 2 } 

for those /c's for which M n > fj, n k . By (1.1), 

M n - ^ k = -^=(1 - e -AV(^) + 0(|A|3/^) )(1 + 
\/2TTL n 

n -A 2 /(2L n ) 



:(l-e" 



(3.1) 

+ 0(e- A2 /( 2L ")|A| 3 L^ 2 ))(l + o(l)) 



>-^=(l-e-A 2 /(3^) )(1 + o(1)) 

uniformly for 1 < | A| < K. On the other hand, the variance is bounded above 
by 

(3.2) Y{Y n , k } = 0(A 2 L~ 2 ^ k ) = 0(A 2 L^V e - A2 / i ") 

uniformly for 1 < |A| < K. It follows from these estimates that 

V{F n , fc } 



l<\A\<K Mn ^ n ' k 



OinL- 1 ). 
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Collecting all estimates, we get a weaker error term than (1.3), 



(3.3) E{T y ri} = _^_ ( i + (L- 1 /2 ))) 
but we only used estimates for E{1^ and Vjl 7 ^}. 

Improving the error term by the fourth central moment ofY n ^. We can 
improve the error term in (3.3) by using the estimate for the fourth central 
moment of Y n ,k] see (1.6). Taking m = 4 in (1.6) and repeating the same 
analysis as above, 

,„(2) <r fvrv u \' E i( Y n,k ~ /Vfc) 4 } 

1<|A|<K ^ n 

/ /-oo 4 -2x 2 /L n \ 

= 0{nL-^) + 0(nL-^j i ^-^— ^ 

{ f°° v^/ 2 e~ 2v 

= 0(nL^) + 0{nL; 2 l /LnJT -^ dv 

= 0{nL-V 2 ) + o(nL- 2 Q v~ 3 / 2 dv^j 

= 0{nL^I 2 ). 
This proves (1.3). 

3.2. Higher absolute central moments of W n . We prove only an upper 
bound for s = 2, namely for the variance of W n ; other values of s follow by 
the same argument and Lyapounov's inequality. 

An upper bound for the variance of the width. We show, by using central 
moments of Y n ,k of order 6, that 

(3.4) Y{W n } = 0(n 2 L~ 3 ), 

which proves (1.5) with s = 2. 

The proof extends that for E{W n }. Define ko = \ L n \ . We start from 

E{(W n - E{W n }) 2 } = E{{W n - fi nM + fi nM - E{^ n }) 2 } 

<2E{(W„ f} + 2E{( f i nM) -E{W n }) 2 }. 

By (1.3), 

E{(i^ nM -E{W n }) 2 } = 0(n 2 L- 3 ) 
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and, similarly to the analysis for E{W n }, 



E{(W n - Mn,fc ) 2 } < E E ( Y n,k +A ~ (J>n,k Q )+ • 1 (y n ,* +A>A«„,» ) \ 

l|A|>0 J 



By (3.2) 



■|A|: 

<V{K„, fc() }+ Y, n(Yn,k +A-fln,k f + } 
1<|A|<JC 

+ E v{y n , fc0+A } 

|A|>if 



(2) (2) 

The estimation of Vn follows mutatis mutandis from that for Wn by using 
(1.6) with 171 = 6: 

v (2) < V- n(Yn,k-Vn,k?} 
1<|A|<A' (^ n > fc o ~~ /^n.feo+Aj 

/ A 6 g -3A 2 /in \ 

= OU 2 L- 7 ^ (1 _ e -AV(3L„))4 J 
\ 1<|A|<A' V e ' / 



r-OO 

'l/L n 



-°(" 2i " 7/2 /; 

For the last term Vn , we use again (2.3), 

V{F re)fc }</4 )fe = 0(L„-r -n- , 

uniformly for 1 < < n, where r > is any bounded real number. Substi- 
tuting this into v gives 



V IAI>j 



r -2fc -2A 



|A|>* 
0(L 2 J 3 r 2 ^ /a 



n e 

— 1/3 

by taking r = 1 + sign(A)L„ 

This completes the proof of (3.4). 
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Higher central moments of W n . The same analysis can be carried out 
for higher absolute central moments using (1.7). Then the same proof for 
V{W n } gives (1.5) by using (1.7) with order 2s + 2. 

Almost sure convergence. We need first a tail bound for the width. By 
Markov's inequality (see page 160 of [29]; sometimes referred to as Cheby- 
shev inequality), 

E{\W n -E{W n }\ s } 



P{|W- n -E{W n }| >eE{W n }}< 



(eE{^ n })* 
0(e~ s L~ s ) 



for any s > and e G (0, 1) . 

From this estimate it follows, by applying the Borel-Cantelli lemma and 
by taking s > 2, that 

EO^J — * 1 alm ° St SUrely ' 

where n t := [e^~ e \, since EeK* = 0(E^~ s/2 ) = O(l). 
Now observe that 

n t+1 -n t = 6(n,r 1 / 2 ) = Q(n £ L^) = Q{E{W nt }L~ l J 2 ). 

On the other hand, by construction, adding a new node to random recursive 
trees affects the value of W n by at most 1. Consequently, 



sup max(|^ n - W ne \,\E{W n } - E{Wn e }\) < n l+1 - n £ 

= @(E{W n \L-]' 2 ). 



n e <n<n t+1 



So, deterministically, 

W„ Wr, 

o 



sup 



n e <n<n e+1 



W n W nt EiWnjLr 1 ' 



E{W n } E{W n \ 



E{Wn e }-(n e+1 -n e ) 

= o(L-y 2 ) 

= o(r 1 / 4 ). 

This completes the proof of (1.8). 

An alternative form to (1.8). The same argument can be modified to 
show that 

Wr, 1 , r. 
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almost surely for any fixed 5 > 0. The proof is modified from that for (1.8) 
as follows. By (1.3), we have 

^^ = ^ T (i + o(^ 1 ))- 

n/v27rL n M^n} 

Instead of n e := \e^~ l \ , we now take rt t := [e^^ 2 ~ s ^\ . Then, setting e = e n = 
L~ 1+s in the proof, we deduce that, again by the Borel-Cantelli lemma, 

almost surely as £ — > oo provided that e~ s L~ s is summable in £. This forces 
the choice s > 2/8. Next, 

n m -n, = 6(E{^ n jr 1 / 4 ). 

This proves (3.5). 

Almost sure convergence for Y n ^. We can also obtain strong conver- 
gence by the same argument for the profiles Y n ^ in the central range [L n — 
L x ~ e ,L n + where e G (0, 1). We prove that 



(3.6) 

almost surely. 



sup 



Y, 



E{y„,«} 



i 



o 



PROOF. Set t n := 2L\ 6 and n t := [e^\. Using (1.7) and Markov's in- 
equality used above, it is easy to see that 



sup 



Y 



0. 



almost surely as I — > oo. By the union bound and the Borel-Cantelli lemma, 
this requires that we take s so large that L~ s t}^ s is summable in L Any 
choice with s > 3/e— 1 suffices for that purpose. Furthermore, by the mono- 
tonicity of Y n & in n for fixed k, 

sup sup \Y n>K -Y nvK \ 



< 



sup |^n <+1) « Y n ^f\ 



and 



sup 



sup 



\E{Y ntK }-E{Y n , ,}| 



< sup \E{Y ne+vK }-K{Y ni>K }\. 
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Thus, 



sup 

n.<n<n. 



Y 

1 n.K 



Y 



E{Y n , K } E{Y n A 



< 



Yn, , , .k Y n 



E{Y n ,} E{Y n A 



Y, 



< 



n e+1 ,K 



Y n 



+ 



E{Y n K } E{Y nvK } \E{Y nvK } jE{Y n K } 



Yn 



Putting the supremum over L n — t n ^ K ^ L n + t n in front of all of the 
latter inequalities, we see that both terms tend to zero almost surely provided 
that 



lim 



sup 



1 



0. 



This follows from an extension of the Taylor series estimate used in (1.1); 
indeed, the estimate (see [25]) 



L 



F(l + k/L n )k\ 

is sufficient for our use. 
Thus we have shown that 



{1 + 0{L~ 1 )) (k = 0(L n )) 



sup 



sup 



n e <n<n t+1 



Y 

1 n,K 



E{y„,„} 



i 



o 



almost surely. An additional argument shows that for £ large enough, [L n — 
L n ~ e ', L n + L n ~ £ ] is contained in [L n( - tn e ,L nj; + t n J for n e <n< n i+1 , thus 
concluding the proof of (3.6). □ 

3.3. Level reaching the width. We now prove Theorem 1.3. For |A| > 
\k — ko\ and B > 1 , 

F(k* = k + A) = F(W n = Y nM+A ) 

,ko) 

< Pf^n,fe +A — Mn,feo+A > ^(^n^o ~ Mn,fco+A)^ 
+ ff > ^^n,fc ~ Hn,k < ~l^{t L n,k ~ Mn,fco+A)^ 



,fc +A — Mn,fc +A 



l-B 



(Mn,fc ~~ ^n,k +A) B 



{^n,ko ~ f>n,k +A) B 
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by Markov's inequality. By (1.1), we obtain an estimate similar to (3.1) for 
fJ-n,k ~ AVfco+A, which together with (3.2) gives 

/A Br-B-BA 2 /(2L n ) t-B 

* = k + A) = O - " * r _ UH + 



(1 _ e -A 2 /(3L n )\B 1 M _ e -A 2 /(3L„))B 

= 0{A~ B + A- 2B V L~ B ) 
= 0(A~ B ) 

uniformly for 1 < | A| < K. It follows that there exists a To > 1 such that for 
T>T , 



\k* — k 



o\>T)=o( y, A j +n\k*-ko\ >K) 

\T<\A\<K J 

= 0{T 1 - B )+¥(\k* -k \>K). 

The tail probability ¥(\k* — ko\ > K) is estimated as follows, where we let 
ki := [y/L^\: 

n\k*-k \>K)<F( max Y n>k > Y n>ko 

\\k-k \>K 

< P( max Y n)k > Hn,k +ki ) + ^(Yn,ko < /W'o+fci) 
\\k-ko\>K / 

- VnM+ki 2^ Vn,k ^ (,.,_,, , , \2 
\k-k \>K ^ n - k ° VnM+klJ 

= 0(L^ e ~^/2 + L - % 

which tends to zero as n — > oo, where we used again (2.3) to bound 
J2\k~k \>K A*n,Jfc- Since -B > 1 is arbitrary, this proves Theorem 1.3. 

Limit distribution of W n ? It is known that the centered and normalized 



random variables (Y n>k — fj, n>k ) / J V{Y nj fc} do not converge to a fixed limit law 
when k = L n + 0(1) due to periodicity; see [21]. The origin of the periodicity 
lies at the second-order term in the asymptotic expansion of Mn,i»+0(i)) 

^ k ° +i - -^m v + ~L~ n + 0(L - 

where {x} denotes the fractional part of x and 
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This periodic second-order term is the origin of all fluctuations of higher 
central moments. Note that 

/o >y\ I \ / P-i(%), if xG [0,1 -7], 

(3.7) max pAx) = S / \ - f _ f-i n 

i&l * v lPo( x )> if x €[1-7, 1J; 

ae[o,i] 

compare (1.4). 

The main open question is the limit distribution (if it exists) of W n . 
Simulations seem to indicate the closeness of the histogram of W n to that 
of Y U: k -i when {L n } < 1 — 7 and to that of Y n> k when {L n } > 1 — 7; see 
Figure 1. 



4. Width of general random log trees. Our methods of proof for recur- 
sive trees can be formulated in a general, simple framework described below, 
which gives sufficient conditions we need to obtain asymptotics of the width 
of general random log trees. We first describe the estimates we need to han- 
dle the width of general random log trees and then discuss a few concrete 
examples in the remaining sections. 



4.1. An analytic scheme for the profile and width. We start from a gen- 
eral framework for the moments of the profiles of random log trees. The 
same notations as those for recursive trees are used in this section. So we 
denote by Y n ^ the profile of the random log trees in question, the initial 
condition being immaterial for our purpose. We impose the following three 
conditions. 



Condition I (Quasi-power form for the expected profile polynomial). 
The generating polynomial of the expected profile /t% k '■= ^{Y n ^} of the 
random log trees in question satisfies asymptotically 

(4.1) E n (u) := J2 Vn,ku k = g{u)n^ u \l + 0(n~ £ )) 

k 

uniformly for \u— 1| <£o, u£C, Eq > 0. Here g and / are analytic functions 
in \u — 1| < £0 and satisfy g(l) = f(l) = 1. 

Condition II (Regularity condition for the expected profile polynomial). 
The estimate 

(4.2) \E n (v)\=0(n 1 - e ) 

holds uniformly for {u G C : 1 — E\ < \u\ < 1 + e\} \ {u £ C : \u — 1| < £0}, 
where < e± < £q; see Figure 2 for a plot. 
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012 




Fig. 1. Simulated histograms for Wn and Y n ^L n j + e for £ — — 1, 0, 1, where n — 10 7 (top), 
for which the expected width is asymptotically reached at [L n \ — 1 ({L n } < 1 — -y), and 
n = 404960 (bottom), for which the expected width is asymptotically reached at YL n \ 
({L n } >l-j); see (1.4) and (3.7). 

Condition III (Asymptotic estimates for the central moments). The 
central moments of Y n ^ satisfy 

(4.3) E{(y n , fc - = 0{\ A\ m L~ m ^ k ) (m > 0) 
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uniformly for |A| = o{L n ). 

Theorem 4.1 (Width regularity of general random log trees). Under 
Conditions I, II and III, the width of the random log trees in question sat- 
isfies the properties 



n 



[1 + OiL- 1 )), 



(4.4) 



V2vrcJ 2 L r 

E{| W n - K{W n }\ s } = 0{n s L^ s ' 2 ) (s > 0), 
(\k*-f'(l)L n \>T)=0(T~ B ). 



The last estimate holds for every B > and T > Tq for some Tq > 1. Fur- 
thermore, if inserting a new node to the tree changes the width by at most 
a bounded quantity, then we also have 



(4.5) 



Wn 



nw n } 



1 



almost surely. 



For ease of reference, we will refer to the properties (4.4) by saying that 
the random log trees are width-regular with parameters (f'(l),o~ 2 ). 




Fig. 2. The different regions in the complex u-plane as used by Conditions I and II for 
Theorem 4.1. 
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Proof (Sketch). First, Conditions I and II imply, by standard applica- 
tion of the saddle point method, that 

<«> --vii^^~ +0(|A|VI K 1+ K^)) 

uniformly for |A| < L 2 / 3 , where A := k — f'(l)L n and 

<r=/f'(l) + /"(l). 

Note that to prove the estimate (4.6), we used (4.1) and (4.2) only when 
u = e* e , 9 S R. However, the uniform estimates (4.1) and (4.2) in a complex 
neighborhood of unity also yield, by Cauchy's integral representation, 

(4.7) /Vfc = 0{L~ l ' 2 r~ k n f ^ + r~ k n l ~ £ ) = 0(L;^ 2 r^ k n f ^) 

uniformly for all k = 0, . . . ,n, where r = 1 + o(l). This crude estimate is 
sufficient for our purpose in bounding all error terms involved. 

The remaining proofs follow closely those used for recursive trees, details 
being omitted here. □ 

Note that our proof for the almost sure convergence (4.5) requires the esti- 
mate for E{|W n -E{W n }| 2+£ } for which (4.3) with m = 8 suffices. Similarly, 
the estimate for the expected width needs (4.3) with m = 4. 

4.2. The moments estimates (4-3). Theorem 4.1 reduces the proofs of 
the width-regularity properties to those for the three hard estimates (4.1), 
(4.2) and (4.3). While the two estimates (4.1) and (4.2) are more tree depen- 
dent and may rely on different analytic tools, we indicate in this subsection 
how the moments estimates (4.3) can in many cases be deduced from (4.1) 
and (4.2), coupling with suitable "asymptotic transfer" for the underlying 
bivariate recurrence. 

Recurrence of profile. The profiles of many random log trees that arise 
from data structures, analysis of algorithms and discrete probability are of 
the form (see [2, 8, 28, 32]) 

(4-8) Y n .u= E Y il\,k-i (n>2;k>l), 

l<j<h 

(i) 

with Y n fl = 1 for n > 1, where h>2, the Y n ) k I's are independent copies of 
Y n ^ and the underlying splitting distribution satisfies Y^i<j<h^n,j = n — k 
for some integer k > 0. Physically, the root of such random log trees has 
at most h subtrees, each of which has the same (recursive) structure; the 

(i) 

distribution of the size of the jth subtree of the root is described by j and 
k represents the number of nodes retained at the root. 
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Then the moments of Y n ^ satisfy a recurrence of the form 

(4.9) a n ^ = h ^ Kn,ja>j,k-i + &n,fe, 

0<j<n 

where ir n j = f(I n ,l = j) satisfies J^j^nJ = 1 an d the 6 ni fc's are known. For 
our purpose, we can always assume that b n ^ = for k < and k>n. 

Higher central moments. If Y n j. satisfies the distributional recurrence 

(4.8), then the central moments := E{(K„ jfc -/x nifc ) m } can be recursively 
computed by the recurrence 

r n,k — 11 1Xn ^ r j,k~\^ ^n,k^ 

0<j<n 

where 

Q { ™k--= E E nin,i=ji,...,in, h =j h } 

io+h-\ Mh=m jr'iH \-j h =n-K 

0<ii,...,if l <m 
0<io<m 

x^-i--^liV^(j) (m>2), 

with [j = 

(4.10) V n>fc (j) := ^ fJ-j^k-i ~ fJ-n,k- 

l<£<h 

By Cauchy's integral formula and (4.1), we have 
'|«|=r ~ V n 

|«-1|<£ 



Vn,fc(j) = 7T— f 9 
2m J\ u \=r 



where 

^( u ^) := E 4 J r 

Since X1kk/i = n + O(l), we deduce, by expanding tp(u; x) at u = 1, the 
two estimates 



(4.11) V„ lfc (j) 



f ((\r - 1| + L" 1/2 )L- 1/2 r- fc n/M) (r = 1 + o(l)), 
\0(|A|L- 3/2 n), 
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where the first estimate holds uniformly for all tuples ■ ■ ■ ,jh) an d 1 < 
k < n, and the second holds for all tuples (ji,---,jh) and A = O(yfL^). 
Note that if we take r to be the solution near unity of the equation rf'(r) = 
k/L n = /'(l) + A/L n , then r = 1 + A/(a 2 L n ) + 0(A 2 /L 2 n ) and 

(4.12) r -fc n /(r) =ne -A 2 /(2a 2 L ?l )+0(|A| 3 L- 2 ) 

2/3 

uniformly for A = 0(L n )• This means that the first estimate in (4.11) is 
not tight when A = o(y/L n ), which is the reason why we need the second 
estimate. 

Once the precise estimates for V n fc(j) are available, the remaining proof 
for (4.3) is then reduced to the derivation of suitable "asymptotic transfer" 
for the recurrence (4.9) via which one deduces estimate for a n ^ from that 
for b Ut k- Instead of further abstraction for general random log trees, we will 
give more details for specific trees below. 

5. Random quad trees and grid trees. We start from quad trees, which 
are useful data structures for spatial points, and then indicate the estimates 
needed for the more general grid trees proposed in [8]. We show that both 
classes of trees are width-regular. 

Random quad trees and their construction. Quad trees were proposed by 
Finkel and Bentley [15]. The first probabilistic analysis of the typical depth 
of a node, the expected profile, the height, and the partial match cost was 
carried out by Flajolet, Gonnet, Puech and Robson [16, 17] (work carried 
out in 1988), Devroye and Laforest [9] and Flajolet, Labelle, Laforest and 
Salvy [18]. 

Given a sequence of n points independently and uniformly chosen from 
[0, l] d , the random (point) quad tree associated with this random sample 
is constructed by placing the first point at the root, which splits the space 
into 2 d hyperrectangles, each corresponding to one of the 2 d subtrees of the 
root. Points that fall in each hyperrectangle are directed to the corresponding 
subtree and are constructed recursively. For more information on quad trees, 
see [18, 28, 32, 41] and the references therein. 

The profile. By such a construction, the profile Y n ^ satisfies (4.8) with 
h = 2 d , k=1 and 

7r n j := P(/ n ,i = ji, . . . , I n 2 d = j 2 d) 

n — 1 

I A n f n b i {i-x i )+(i-b i )x)\ dx, 

"'[0,l] d \<e<2 d \l<i<d ) 
l-l = (b U ...,b d )2 



24 L. DEVROYE AND H.-K. HWANG 

where (£>i, . . . ,6^)2 denotes the binary representation of t — 1 [prefixed by 
zeros if [log 2 (^ — 1)J < d — 1] and dx. = dx\ ■ ■ ■ dx^. 

The underlying recurrence. From the expression for vr n j, it follows that 
all moments of Y n ,k satisfy (4.9) with (see [18]) 

(5.1) %j = - E ■ ~ (0<J<n). 

72 j<h<-<j d -i<n Jl '" 3d ~ 1 

In particular, the expected profile [i n ^ satisfies the estimates (4.1) and (4.2) 
with f(u) = 2u x l d - 1 and 

1 

g(u) :-- 



T(2u 1 / d ) d {2u 1 / d - 1) 

r(2^ 1 / fii (l - e 2hi l d )) 
X Jf <d T(2-2u 1 / d e 2im / d ) ' 

see [4] and [20]. The exact form of g is less important for our purpose; the 
analyticity of g for u near unity is, however, technically useful. Note that 

Recurrence of P$ := E{(Y riik - fi n , k ) m }. Obviously, P$ = 1, P$ = 



(m) d 

0<j<n 

where 



and P^ fc satisfies the recurrence 



n (m) \- ( m \ 



(io,...,i 2d )&I; 



x V tt .p(n) ... p(v) V io 

^nj-^i.fc-l ^j 2d ,fc-l V ra,A;UJ 



JiH hj 2 d=n-l 

Here V ni fe( j) is given in (4.10) with h = 2 d there and 

Zm := {(io, • • .,«2"0 6 {0, ... , m} x {0, . . . , m — l} d : io H H 2 <* = m }- 

Following the proof pattern for recursive trees and the discussions in Sec- 
tion 4.2, we prove, based on the estimates (4.11), the two bounds 

(zo) p(m) _/ 0{{\r-l\ m + Ln m/2 )Ln m/2 r- mk n m f^), r=l + o(l), 
1 j n > k \0(|A|-L; 3m/2 n-), 

the first being uniform for 1 < k < n and the second uniform for A := k — 
f'{l)L n = 0(y/L^). These two estimates imply (4.3) by (4.7) and (4.12). 
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An asymptotic transfer for the double-indexed recurrence. To justify the 
width-regularity properties (4.4), it remains to prove the two estimates in 
(5.2). For an exact solution for (4.9) similar to (2.9), see [18]. Here we use a 
different inductive argument, which is easily amended for other varieties of 
trees. 



Lemma 1. Assume that a nk satisfies (4.9) with 7r n j given in (5.1) and 
that a nfi ,a ljk = 0(1). If 

\b n , k \<c\k-f(l)L n \ x LPp- k n a 

for n > 1 and 1 < k < n, where A > 0, G R, c> and the two real numbers 
a, p > satisfy p < ((a + l)/2) d , then 

(5.3) \a n>k \ < C \k - f'(l)L n \ x L^p- k n a 

for n > 1 and 1 < k < n, where Cq > is chosen so large that Cq > c/(l — 
e-2 d p/{a + l) d ). 

Proof. We apply induction on k and n. The boundary conditions are 
easily checked by taking Co sufficiently large. We may assume that \k — 
f'(l)L n \ — ► oo, for otherwise we need only modify the value of c. By the 
induction hypothesis, we have (see [4]) 

K,k\ <c\k- f'(l)L n \ x LPp- k n a 

+ 2 d C p 1 ~ k n- 1 J2 1 ■ 

l<J<Ji<-<id-i<n Jl-'-Jd-l 

~c\k-f\\)L n \ x Lip- k n a 

= cn a Lip- k + —^—C,{1 + o(l))\k - f'(l)L n \ x L^ k n a ; 
[a + l) a 

thus (5.3) follows by properly tuning Cq [since p < ((a + l)/2) d ]. □ 

Asymptotics of ' . We prove by induction the first bound in (5.2). 
Assume m>2. 

Consider Q^j, ■ As for recursive trees, we distinguish between two cases. 
If j\, . . . ,j 2 d > n/L™, then Lj e ~ L n for i = 1, . . . , 2 d and we have 



E 



m 

10, ■■■,l2d 
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( 5 - 4 ) >< E ^j^_r--^liV^(j) 

n/L%<ji,...j 2 d<n 
= 0{(\r - l\ m + L- m l 2 )L- m / 2 r- mk n m ^ r) ). 

We now assume that one of the j^s, say j\, is less than n/L™. We may 
furthermore assume that the corresponding index i\ of j\ is nonzero; for 
otherwise, if all ie = for those jg's with ji < n/L™, then the bound on the 
right-hand side of (5.4) obviously holds, since all other jgs satisfy Lj e ~ L n . 

Terms in with i\ > 1 and ji < n/L™ are bounded above by 
o(r~ mk J2 n ( "^ il)/(r) E TTn^ji 1 ^ ] =0(L^ OT r- mfe n m/ W). 
This proves that 

= 0((|r - l| m + L- m l 2 )L- m l 2 r~ mk n m f^). 

Thus the first estimate in (5.2) holds by applying the O-transfer of Lemma 1. 
The proof of the second estimate in (5.2) follows by applying the same in- 
ductive argument; the details are omitted here. 

Consequently, random quad trees are width-regular with parameters (2/d, 
2/d 2 ) and with the almost sure convergence (4.5). All our results are new 
except when d=l, for which quad trees reduce to binary search trees and 
the almost sure convergence in (4.4) was derived in [3], and the expected 
width in [13] (with a weaker error term). 

Random grid trees. Grid trees were first proposed by Devroye [8] and 
represent one of the extensions of quad trees. Instead of placing the first 
element in the given sequence at the root (as in quad trees), we fix an integer 
m > 2 and place the first m—1 elements at the root, which then split the 
space into m d hyperrectangles (called grids). The remaining construction is 
similar to that for quad trees. 

In this case, we have h = m d and (Jq := j, jd := n — m + 1) 

Err v m—i / 
11 



(jt+m-l 

3<3l<-<3d-i<n-m+ll<l<d V m-1 



and (4.1) and (4.2) hold by applying the approach proposed in [4], where 
f(u) satisfies 



((/(«) + !)••• (f(u) +m- l)) a = m\ d u (m>2;d> 1), 
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with /(l) = 1. An O-transfer similar to that given in Lemma 1 can also be 
derived by noting that 

£ 7 r nj |A;-l-/ , (l)L i | A Lfr 

l<j<n 



E \k-i-ni) Lj \^r n tt^M 



l<j<il<-<Jd-l<n-m+l 1<£<<2 V m -l 



(m- l) c 



)1 

m-2 



E 



jd-i 



l<h-i<n Jd ~ lK U 

E E ^ ( ^ N " 

l<Jd-2<Jd-l 1<J1<J2 J1 ^ J2 

m-2 



E ^-/'(l^f^rji 

i<?'<n 



Jl 



\ r(m + a) / 

so that the same type of asymptotic transfer there holds when a, p > satisfy 
the inequality 

f(a+ l)---(a + m- 
P< j 

Thus random grid trees are width-regular with 



1 , # 



(2) 



0" 



2 -"771 



d(H m -l)' d?(H m -lf 
and with the almost sure convergence (4.5), where H m :=Hi<j<m Vi an d 

ffm := Ei<j< m l/i 2 ' Note that d = 1 corresponds to m-ary search trees 
(see [32]), and~m = 2 corresponds to quad trees. No martingale structure is 
known for grid trees for general (m,d). Our results are new. 

6. Generalized m-ary search trees. The m-ary search tree, proposed by 
Muntz and Uzgalis [38], generalizes the binary search tree. For the random 
version built from a random permutation of {1, . . . ,n}, early results on the 
typical depth and expected profile are from [33, 34]. These trees in turn led 
to the generalized m-ary search trees of Hennequin [23] (see also [5] ) . Instead 
of placing the first m — 1 elements in the given sequence of numbers at the 



28 



L. DEVROYE AND H.-K. HWANG 



root (as in m-ary search trees), we choose a random sample of m(t + 1) — 1 
elements, where m > 2 and t > 0, and sort it in increasing order. Then use the 
(t + l)st, the 2{t + l)st, ... and the (m — l)(i + l)st smallest elements in the 
sample to partition the original sample into m groups, corresponding to the 
m subtrees of the root. Elements that fall in each subtree are constructed 
recursively in the same way and the process stops as long as the subtree 
size is less than m(t+ 1) — 1, which can then be arranged arbitrarily, since 
asymptotically this will have a limited effect. 

In this case, the profile Y n satisfies (4.8) with h = m and 



"(-^n,l — jli • • • 1 1n,m — j 



■■■<?? 

n.m — Jm) — / n 



\m(t+l)-l) 

Furthermore, (4.1) and (4.2) hold with f(u) satisfying the equation (see [32]) 

(/(«) + t + 1) • • • (/(«) + m(t + 1) - 1) = (m { f^y )l u, 

with f(l) = 1, where m>2 and t > 0; see [5, 6, 7] for the asymptotic tools 
needed (based on differential equations). Straightforward computation gives 

rr(2) _ rr(2) 



H m {t+i) - Ht+i ' {H m (t+i) - Ht+i) z 

The estimate (4.3) can be checked by an inductive argument similar to 
quad trees by using the expression 

n-l-j \ 
\tl\(m-l)(t+i)-l) 
^nj — 7 n \ ■ 
\m(t+l)-l) 

In particular, we can derive an O-transfer similar to Lemma 1 with the two 
numbers a,p there satisfying 

(a + 1 + 1) • ■ • (a + m(t + 1) - 1) 
P< (t + 2)---(m(t + l)) ' 

Thus the generalized m-ary search trees are also width-regular with the al- 
most sure convergence (4.5). 

Note that m-ary search trees correspond to t = and that m = 2 reduces 
to the so-called fringe-balanced or median-of-(2t + 1) binary search trees; 
see [8]. 

7. Random increasing trees. Bergeron, Flajolet and Salvy [2] proposed 
the increasing tree model: rooted trees with increasing labels along paths 
down from the root, such that the number of trees of a certain structure 
is prescribed in some general manner. The exponential generating function 
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t{z) := J2n T n>i zn / n ' f° r the number r n of increasing trees (of n labels) has 
the form 

(7.1) r'(z)=^r(z)), 

with t(0) = and r(l) = 1 for some function (j){w) with (j)(0) = 1 and non- 
negative Taylor coefficients. The degree function (ft(w) specifies how the trees 
are recursively formed. In this case, there are three representative varieties 
of increasing trees: (i) recursive trees with 4>(w) = e w ; (ii) binary increasing 
trees with (f>{w) = (1 + w) 2 ; (iii) plane-oriented recursive trees (PORTs) with 

4>(w) = 1/(1-10). 

We already studied the width of random recursive trees and random bi- 
nary increasing trees (identically distributed as random binary search trees) . 
We consider first PORTs and then mention other varieties of increasing trees 
(in some generality). 

Random PORTs. PORTs are ordered (or plane) increasing trees without 
restriction on the degree of each node. To the best of our knowledge, such 
trees first appeared in a combinatorial form in [40], and in a more general 
probabilistic form in [44]; see also [2, 39, 43] for more details. Probabilistic 
properties of PORTs received much recent attention due partly to their close 
connection to random complex models; see [26] for more references. 

The recurrence for the profile Y n ,k is similar to (2.1), but with a very 
different underlying distribution (see [26]) 

Y n ,k = + Y:_ Intk (n>2;k> 1), 

where the Y* k 's are independent copies of Y n ^ and 

2 /2j-2w2n-2i-2x 

*nj = m = j) = ^.l^r n )- 

An-l) 

We have 

-<"»-(r^(^)" , '' +1,/2+1 ) (I+ ° ( ' re)> 

uniformly for |u| < C for any C > 0; see also [2, 31]. Then we have (4.1) 
with f(u) = (u + l)/2, so that f'(l) = a 2 = 1/2. Note that although the 
recurrence satisfied by Y n ^ is not of the form (4.8), the technicalities are 
similar to those for recursive trees; see [26] for details. Thus PORTS are 
width-regular with parameters (1/2,1/2); the almost sure convergence (4.5) 
also holds. 

The widths and profiles of random increasing trees for which l/(f)(w) 
equals a polynomial also exhibit similar behaviors. 
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Polynomial varieties. We now show that the same width-regularity re- 
sults (4.4) also hold for polynomial varieties of increasing trees; see [2]. 
Briefly, these are increasing trees in which each node has at most d sub- 
trees and the exponential generating function t(z) := J2 n >i T nZ n /n\ satisfies 
(7.1) with 

<Kw):= Yl fa™* ( d > 2 )> 

0<j<d 

where <pj > for < j < d and </>o, <pd > 0. In this case, it is known that 

(7 ' 2) x R -n n -( d -2)/(d-2) {1 + 0(n -2/( d -l) )); 

where p denotes the period of 4>(v), R := J °° dv/<fi{v) and 

£E{Y n , fc }^ = (r' [ Z {r\v)) l - U dv; 

see [2]. From these relations, we deduce the two estimates (4.1) and (4.2) 
with 

f( \ d 1 



d-l d-r 

g{u)=^~ u),{d ~ 1) {R{d-l)f^l {d ^ 

r(i/(d-i)) r°° , u , 
x r {du /( d -i)) i dv - 

Furthermore, the higher moments of Y n & (centered or not) satisfy the re- 
currence 

l<j<n 

where 

^nj== ( "~ 1 ? !Tj [^- 1 - j y(r(z)). 

By (7.2), we then derive an O-transfer for a n ^ similar to Lemma 1 with a 
and p there satisfying 

d-l / 1 



c^" 1 V d-l 

from this the estimates (4.3) are then justified, implying the width-regularity 
properties (4.4). 
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Random mobile trees. These are increasing trees in which all subtrees are 
arranged in cyclic order and whose enumerating generating function satisfies 
(7.1) with (j){w) = 1 — log(l — w); see [2]. This example is less natural but 
very interesting because nodes are distributed in a rather different way and 
the trees are not width-regular. 

First, the generating polynomial for the expected profile is given by 

E n (u) = Y.Vn,kU k = -[z n ]r'(z) u rr'ivf^dv. 
k Tn Jo 

Here the number r n of such trees satisfies 

Tn 



ni 



R^n-^l + OiL- 1 )), 



where R = f£°(l + v) l e v dv. By singularity analysis (see [19]), we deduce 
that 



E n (u)=g(u)nL u n - i (l + 



log K 



where the O-term holds uniformly for bounded complex u and 

/•OO 

g(u) = R- 1 u e- v (l+v)- u dv. 



Note that this is not of the form (4.1) and g(l) = 1. Thus such mobile trees 
are very "bushy" at each level (the root already having about n/L n nodes) 
and we have 



n 



max/i„ ifc 

k y/2'K\0gL n 



the mode being reached at k ~ log L n . The same methods of proof we used 
for recursive trees can be extended to show that 

n 

V27T log L n 

a very different behavior from all types of random trees we have discussed. 
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